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ABSTRACT 

We obtained deep 8.5' x 8.5' near-infrared images within four high-redshift survey fields, measured 
the K s magnitudes of 300 optically selected galaxies with spectroscopic redshift 1.8 ^ z ^ 2.6, and 
compared the spatial clustering strength of galaxies with K s < 20.5 and K s > 20.5. We found at 
greater than 95% confidence that the brighter galaxies cluster more strongly. The best-fit correlation 
lengths for the bright and faint samples are 10 ± 3 and 4 ± 0.8ft. -1 comoving Mpc, respectively (lcr), 
although the unusual density of bright QSOs in one of our survey fields may imply that the result is 
not representative of the universe as a whole. Neglecting this possibility, the correlation length for 
the optically selected sample with K s < 20.5 agrees well with that reported for comparably bright 
near-IR-selected samples. The differences in correlation length between optically selected and near-IR- 
selected samples have been presented as evidence that the two techniques find orthogonal populations 
of high-redshift galaxies. Our results favor a more nuanced view. 

Subject headings: galaxies: high-redshift — cosmology: large-scale structure of the universe 



1. INTRODUCTION 

Near-infrared (IR) surveys of the high-redshift uni- 
verse (Cimatti et at. 2002, Franx et al. 2003, Abra- 
ham et al. 2004) were designed to find passively evolv- 
ing galaxies that had been missed by optical surveys, but 
they also uncovered an intriguing population of galaxies 
whose star- formation rates M* £ 200M Q yr _1 and stel- 
lar masses M* J,2x 10 11 M Q dwarfed those of optically- 
selected galaxies. Since these star-forming galaxies have 
the properties expected for the progenitors of giant el- 
liptical galaxies, many researchers have begun to assume 
that near-IR surveys find the progenitors of ellipticals, 
that optical surveys find the progenitors of less massive 
local galaxies, and (by extension) that the high-redshift 
universe is divided into two distinct star-forming popu- 
lations with divergent evolutionary paths. 

Although some of the near-IR-selected galaxies clearly 
do not satisfy common optical selection criteria (see, e.g., 
Daddi et al. 2004; Forster-Schreiber et al. 2004; van 
Dokkum et al. 2004), the evidence for two distinct star- 
forming populations still leaves room for doubt. First, 
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the differences in mean stellar masses and star-formation 
rates might simply reflect the fact that near-IR surveys 
tend to be shallower than optical surveys, detecting only 
the brightest tip of the luminosity distribution. Since the 
near-IR-selected star-forming galaxies are less numerous 
than optically selected galaxies, a significant fraction of 
them could be present in optical surveys without chang- 
ing the mean properties of optical sources in an appre- 
ciable way. The unusual measured properties of bright 
near-IR galaxies do not imply that they are absent from 
optical surveys any more than the high mean income of 
Luxembourgers implies that they are a previously undis- 
covered population in the E. U. Second, sub-millimeter 
surveys show that the majority of the most rapidly star- 
forming galaxies at high redshift are bright enough and 
blue enough to be included in existing optical surveys 
(e.g., Chapman et al. 2005; see their Figure 6). Finally, 
there are physical reasons to believe that at least some 
of the elliptical galaxies in the local universe descended 
from the objects detected in optical surveys. Ellipticals 
are the only local galaxies that have the spatial distri- 
bution expected for the descendants of optically selected 
star- forming galaxies at z > 1.5, for example (see Adel- 
berger et al. 2005). 

Partly in order to help clarify the connection between 
optical and near-IR populations, our group has obtained 
K s images throughout the course of its optical surveys 
at 1.4 < z < 3.5. Recent analyses of the K s images 
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(Erb et al. 2005, in preparation) reveal that many 
optically-selected high-redshift galaxies have the mag- 
nitudes K s < 20 characteristic of the supposedly dis- 
tinct bright near-IR-selected populations. Shapley et al. 
(2004) showed that these optically-selected galaxies with 
K s < 20 have star-formation rates, stellar masses, and 
metallicities similar to those of star-forming galaxies in 
near-IR-selected surveys to K s — 20. This suggested 
that there might be substantial overlap between near- 
IR and optical populations at high redshift, a conclusion 
reinforced by their estimate that the number density of 
high-redshift galaxies with K s < 20 in optically-selected 
surveys is at least half as large as the number density re- 
ported by near-IR surveys. Daddi et al. (2004, 2005) 
reached a similar conclusion from a different starting 
point. 

The present paper extends the work of Shapley et 
al. (2004) in a small way by showing that optically- 
selected galaxies at z ~ 2 with bright K s magnitudes 
cluster as strongly as their near-IR-selected counterparts. 
This suggests that the if a -bright star-forming galaxies 
detected in optical surveys may not be enormously dif- 
ferent from the if s -bright star-forming galaxies that are 
missed. We will discuss the implications further in § 
after first describing our data in § [21 our method for mea- 
suring ro in § |3 and our raw results in § 0] 

2. DATA 

The galaxies we studied lie within the fields GOODS- 
N, Q1623, Q1700, and Q2343 of the spectroscopic survey 
of Steidel et al. (2004). Steidel et al. (2004) obtained 
U n G1Z images of these fields, constructed a catalog of 
objects with AB-magnitude 1Z < 25.5 whose colors sat- 
isfied the z ~ 2 "BX" selection criteria of Adelberger et 
al. (2004), measured spectroscopic redshifts for several 
hundred of the objects in this catalog, and, after exclud- 
ing objects with z < 1, found that the mean redshift and 
r.m.s. dispersion z±a z = 2.21 ±0.34 agreed well with the 
range targeted by the photometric selection criteria. To 
measure the near-IR magnitudes of some of the galaxies 
in the catalog, we (Erb et al. 2005, in preparation) sub- 
sequently used the Wide-Field Infrared Camera (WIRC; 
Wilson et al. 2003) on the Hale 5m telescope to obtain a 
deep K s image (~ llhr integration time, 5a point-source 
detection threshold of K s ~ 22.3 in the Vega system) of 
a roughly 8.5' x 8.5' region within the larger optically- 
observed area in each of the four fields. A total of 1598 
objects in the photometric BX catalogs of Steidel et al. 
(2004) fell within the areas observed with WIRC. We re- 
strict our analysis to the 368 of these sources that have 
measured redshifts. Most (300) of the sources with red- 
shifts were detected in K s at 3tr significance or better, 
and 75 were brighter than the K s = 20.5 threshold that 
we use below to divide our sample into bright and faint 
subsamples. See tabled all near-IR magnitudes here and 
elsewhere are in the Vega system, and all optical magni- 
tudes are AB. K s = 20.5 was chosen because it was the 
brightest threshold that left us with enough objects in 
the bright subsample; only 20 objects satisfy the thresh- 
old K s = 20. The redshift distributions for the bright 
and faint subsamples are shown in figure Q 

The total number of galaxies in our subsamples would 
be significantly higher if our spectroscopy were 100% 
complete. Since the fraction of spectroscopically ob- 
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Fig. 1. — Redshift distributions for the galaxies in our samples. 
The top panel shows the raw number of galaxies with K s < 20.5 
and K s > 20.5 (Vega system). The bottom panel shows the same 
data normalized to constant area for z > 1. The overall redshift 
distribution for the "BX" survey of Steidel et al. (2004) is shown 
as a smooth curve in the background. The low redshift tail with 
z < 1 is excluded from the clustering analysis of this paper. 



Table 1. Observed fields 



Name 


AQ pt a 


AO m b 


xrTR c 

img 


Jv dct 


^20.5 


GOODS-N 


14.0 x 13.4 


8.7 x 8.0 


84 


60 


10 


Q1623 


16.1 x 11.6 


8.7 x 8.8 


121 


98 


30 


Q1700 


11.5 x 11.0 


8.5 x 7.8 


62 


57 


13 


Q2343 


22.5 x 8.5 


8.7 x 5.4 


101 


85 


22 


total 


695 


259 


368 


300 


75 



a Area imaged in U n G1Z and observed spectroscopically, square 
arcmin 

b Area imaged in K s and observed spectroscopically, square ar- 
cmin 

c Number of sources in the BX catalog that fall within the K s 
image and have spectroscopic redshifts z > 1. 

d Number of sources in the BX catalog that are detected in the 
K s image at 3cr confidence and have spectroscopic redshifts z > 1. 

c Number of sources in the BX catalog with K s < 20.5 (Vega 
system) and spectroscopic redshifts z > 1. 



served BX candidates that have 1.8 < z < 2.8 is roughly 
0.8, 0.65, 0.5 for K s > 20.5, K s < 20.5, K s < 20, we 
estimate from the apparent magnitude distribution of all 
BX candidates that the total number of BX candidates 
in our fields with 1.8 < z < 2.8 is 1000, 200, 80 for 
the same three magnitude ranges. Although our spec- 
troscopy is far from complete, our spectroscopic catalog 
of galaxies with K s < 20.5 is already about an order of 
magnitude larger than any near-IR selected catalog in 
the same redshift range. 

3. METHODS 

We estimated the correlation lengths of the samples 
with two approaches. Both were based on counting 
the observed number of galaxy pairs with comoving ra- 
dial separation |AZ| < i and comparing to the num- 
ber expected for a reduced correlation function of the 
form £(r) = i r / r o) ~ 1 ■ Letting Zi denote the comov- 
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ing distance to redshift Zi and adopting the shorthand 
Zij = Zi — Zj , we counted the observed number of pairs 
in the each field with radial separation < | AZ\ < I and 
summed over fields to get A o b s (0,^), the total number of 
pairs with < \AZ\ < £. 

If the selection function P(z) is accurately known, the 
expected number of pairs with < \AZ\ < £ can be 
estimated for any assumed value of ro. Suppose we know 
that the jth galaxy in a field has redshift Zj and that the 
ith galaxy lies at an angular separation of 9ij. Then 
the probability that the unknown redshift Zi will satisfy 
< \Z i:j \ <£is (Adelberger 2005) 



p(o< \Zij\ <£\z j e ij ) = 



P{z 3 )[2£g-\ Zj ) + al] 
1 + aP( Zj ) 



(1) 



where a = rl[f {z^dij} 1 ~>g 1 (z j )0('y), g(z) = c/H[z) is 
the change in comoving distance with redshift, f(z) = 
(1 + z)Da(z) is the change in comoving distance with 
angle, Da(z) is the angular diameter distance, f3(j) = 
B[l/2, (7— 1)/2], B is the beta function in the convention 
of Press et al. (1992), and T is related to the incomplete 
beta function I x of Press et al. (1992) through X = 
7 x [l/2,( 7 - l)/2] with x = £ 2 /{£ 2 + [f{z 3 )9 l3 ] 2 }. The 
expected number of pairs with < \AZ\ < £ is equal to 
the sum of the probabilities P(0 < \Zij\ < £\zj8j) for 
each unique pair, 



N,- 



pairs 



< \Zii \ <£\zi 



(2) 



where the sum includes only pairs in which both galaxies 
lie in the same field. 

Our first approach towards estimating r is to find the 
value that makes the expected number of pairs A exp (0, £) 
equal the observed number N o b s (0,£). By summing over 
the pair counts in all fields when calculating A oxp and 
iV bs) we are implicitly assuming that the selection func- 
tion P(z) does not vary appreciably from one field to the 
next. The estimated values of ro will be incorrect if this 
is not the case, but the analysis is otherwise insensitive 
to variations in data quality from field to field. 

Even if the selection function is reasonably constant 
among the fields, as it probably is, the inferred value 
of ro will still depend sensitively on its assumed shape 
(Adelberger 2005). This is the major weakness of equa- 
tion 13 An alternative is to remove the sensitivity to the 
selection function by dividing the number of pairs with 
< AZ <£by the number with < AZ < 2£. As shown 
by Adelberger et al. (2005) , the expectation value of this 
quotient is 



A obs (0,^) 



N oha (0,2£)/ ^C^ + diR^Z)] 



(3) 



where £(R,Z) = £[(i? 2 + Z 2 ) 1 / 2 ] and R tJ = (1 + 
Zi)DA(zi)9ij, regardless of the selection function, as 
long as the angular diameter distance changes slowly 
with redshift, £ is significantly larger than the comov- 
ing uncertainty in a galaxy's radial position, 2£ is sig- 
nificantly smaller than the selection function's width, 
and N ohs (0,2£) is large enough that (1/A obs (0, 2£)) ~ 
l/(A ? o bs(0, 21)). Our second approach to estimating r is 
to find the value that makes the right-hand side of equa- 
tion|3]equal the observed quantity A o b s (0,^)/A o b s (0, 21). 



The estimate of ro from this approach is barely affected 
by even major changes in data quality from one field to 
the next. 

Although this insensitivity is a significant advantage, 
the disadvantage of the second approach is that it can 
be noisy. We adopt both approaches because one is less 
susceptible to systematic errors and the other is less sus- 
ceptible to random. See Adelberger (2005) for further 
discussion. 

In both approaches we adopt £ — 20h~ 1 Mpc, consid- 
erably larger than the biggest plausible peculiar velocity 
or redshift measurement error, and assume that the cor- 
relation function has the slope of 7 = 1.6 measured from 
a larger sample of similar galaxies by Adelberger et al. 
(2005). Adopting 7 = 1.8 instead would lower our de- 
rived correlation lengths by less than 10%. Eliminating 
pairs with B i3 > 300" (e.g., Adelberger 2005) does not 
significantly alter the results. 

4. RESULTS 

Equations [21 and [3| lead to the estimates ro = 3.8, 
4.4/i _1 comoving Mpc for the subsample with K s > 20.5 
and and ro = 9.9, 10.8ft- -1 comoving Mpc for the sub- 
sample with K s < 20.5. To calculate the significance of 
the observed difference between the two samples' correla- 
tion lengths, we split our full 368-object sample into two 
parts at random, rather than according to K s magnitude, 
measured the difference in ro between the two parts, then 
repeated the process over and over, with a different ran- 
dom splitting each time, to estimate the distribution of 
Aro under the null hypothesis that ro is unrelated to K s 
magnitude. Each random splitting placed 75 of our 368 
objects in one subsample and 293 in another, mimicking 
the 75/293 division of the actual splitting at K s = 20.5. 
We observed a difference in ro as large as the observed 
difference in fewer than 5% of the randomized catalogs, 
implying that the observed difference is significant at bet- 
ter than 95% confidence. 

We used two approaches to estimate the size of the 
uncertainties in the correlation lengths for the two sam- 
ples. First, we measured the dispersion in the the best-fit 
value of ro from each individual field. Although in princi- 
ple this empirical approach should work well, in practice 
ro is poorly determined in individual fields and the field- 
to-ficld dispersion in ro is poorly determined since there 
are only four fields. 

Second, we broke our observed bright and faint cata- 
logs into many smaller sub-catalogs by rejecting a ran- 
dom fraction p = 0.5-0.9 of the sources, observed how the 
dispersion in the best-fit ro value among the sub-catalogs 
depended on the number of objects in the catalog, and 
extrapolated to the full catalog sizes. 

We adopt compromise values for the standard devia- 
tion in our final estimates for the lcr confidence intervals 
for the two analyses (eq. [5] and 0J) of the two catalogs, 
r = 3.8 ±0.8, 4.4±0.8/i" 1 comoving Mpc for K s > 20.5 
and r = 10 ±3, ll±8/i -1 comoving Mpc for K s < 20.5. 
The consistency of the answers for the two approaches 
suggests that systematic problems are minimal. We will 
adopt the the results from eq. |2for the remainder of the 
paper, since that equation is subject to smaller random 
uncertainties. 

To verify that the high value of ro for the bright sample 
reflected genuine large-scale clustering, rather than (say) 
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Fig. 2. — Left panel: correlation lengths for sub-samples with 
Vega-magnitude K 3 < 20.5, 20.5 < K s < 21.5, and K s > 21.5. 
The number of galaxies in the three subsamples is 75, 145, and 148, 
respectively. The values shown are from equation [3] Large circles 
are for the full sample; their uncertainties in ro were estimated 
with the approach of § |I] Small squares and crosses show the 
results for Q1623 and for the other three fields, respectively. Right 
panel: Estimated stellar mass vs. K s magnitude for galaxies in the 
sample of Steidel et al. (2005). Small circles show the results for 
individual galaxies. Large circles with error bars show the mean ± 
r.m.s. for galaxies in different magnitude bins. 



the presence of merging sub-units within individual ha- 
los, we repeated the analysis after excluding all pairs with 
9ij < 60" (i.e., projected separation 1.1ft -1 comoving 
Mpc) from the calculation in equations [5] and [3J The 
best-fit values of ro were not significantly altered. 

5. DISCUSSION 

Our analysis of the largest existing spectroscopic sam- 
ple of galaxies with K s < 20.5 at redshift z ~ 2 confirms 
previous reports (Daddi et al. 2004) that these galax- 
ies are strongly clustered. Their estimated correlation 
length of ro = 10±3ft -1 Mpc significantly exceeds that of 
normal galaxies at high or low redshift (e.g., Adelberger 
et al. 2005, Budavari et al. 2003). The exact value of the 
correlation length at K s < 20.5 may be more uncertain 
than this number implies, since the Daddi et al. (2004) 
sample is so small (see Adelberger 2005) and since much 
of our signal comes from a field (Q1623; see figure |5J) 
which contains an unusual concentration of bright QSOs 
at 2.2 z < 2.6 4 , but the increase of clustering strength 
with K s luminosity has been established with high sig- 
nificance (> 95%; see §0J. 

The results of this paper, coupled with those of Shap- 
ley et al. (2004), suggest that star-forming galaxies in 
optically-selected and near-IR-selected high-redshift sur- 
veys have similar properties when the surveys are re- 
stricted to a common magnitude limit K s ^ 20.5. Galax- 
ies with K s < 20.5 undoubtedly make up a larger fraction 
of the sources in the near-IR surveys, but they do not 



seem to be a fundamentally different population. The 
left panel of figure shows, for example, that their large 
correlation lengths reflect the culmination of a trend that 
is apparent among the fainter galaxies that dominate op- 
tical surveys. Although existing optical surveys do not 
detect every star-forming galaxy with K s < 20.5, the 
fraction they find appears to be reasonably representa- 
tive of the whole. 

Since K s and 1Z — K s are correlated in our sample, 
the relationship between ro and K s implies relationship 
between ro and 1Z — K s . We find that ro increases 
from 4.0 ± 0.8ft -1 Mpc at K(AB) - K s (Yeg&) < 3.5 to 
ro = 9 ± 3ft -1 Mpc at 7Z — K s > 3.5. A correlation of ro 
with both K s and 1Z — K s could result from an under- 
lying correlation between ro and stellar mass M*, since 
7Z — K s and K s are both correlated with M* (Shapley et 
al. 2005, in preparation). An underlying correlation with 
M* could also explain why ro varies so strongly with K s . 
Spitzer Space Telescope observations analyzed by Shap- 
ley et al. (2005, in preparation) show that mass-to-light 
ratios in our sample increase significantly with K s lumi- 
nosity, which implies that M* is a strong function of K s 
(see the right panel of figure [21 . The very high stellar 
masses associated with K s < 20.5 can only be produced 
in unusually massive potential wells that formed unusu- 
ally early, and these are exactly the potential wells that 
ought to be most strongly clustered. The increase of 
ro with i^-luminosity may simply reflect the fact that 
galaxies in proto-cluster environments formed earlier and 
were consequently able to convert more of their baryons 
into stars by z ~ 2. Steidel et al. (2005, in preparation) 
discuss the point in more detail. 

The strong dependence of galaxy properties on K s 
magnitude can be viewed in two ways. On the one hand, 
since galaxies with K s ^ 20 are not representative of the 
high-redshift population as a whole, conclusions from 
bright near-IR-selected surveys probably apply to only 
a limited subset of galaxies. These galaxies are not irrel- 
evant, since they are plausible progenitors for the rarest 
and most massive ellipticals in the local universe, but 
their properties should not be ascribed to the dominant 
fainter population. On the other hand, the extreme prop- 
erties of galaxies with K s ^ 20 may help reveal trends 
that are difficult to discern in fainter samples, and these 
trends could provide useful insight into the physics of 
galaxy formation. Exploring the second possibility will 
require extensive catalogs of high redshift galaxies with 
K s 20. The ability of optical surveys to find them in 
large numbers is surely good news. 
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4 These QSOs are not in the BX sample. The sample analyzed nQt bc a fair samplc of thc univ sincc onc of our fields 

here contains only 3 QSOs^ and their clustering does not signifi- contains one of the richest known concentrations of bright QSOs. 

cantly affect the results. The point is that our surveyed volume 
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